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FOREWORD 


Opinions,  interpretations,  conclusions  and  recommendations  are 
those  of  the  author  and  are  not  necessarily  endorsed  by  the  U.S. 
Army. 

_  Where  copyrighted  material  is  quoted,  permission  has  been 

obtained  to  use  such  material. 

_  Where  material  from  documents  designated  for  limited 

distribution  is  quoted,  permission  has  been  obtained  to  use  the 
material . 

_ _  Citations  of  commercial  organizations  and  trade  nainec  in 

this  report  do  not  constitute -an  of ficial  Department  of  Army 
endorsement  or  approval  of  the  products  or  services  of  these 
organizations. 

_  In  conducting  research  using  animals,  the  investigator (s) 

adhered  to  the  "Guide  for  the  Care*  and  Use  of  Laboratory 
Animals,"  prepared  by  the  Committee  on  Care  and  use  of 
Laboratory  Animals  of  the  Institute  of  Laboratory  Resources, 
national  Research  Council  (NIH  Publication  No.  86-23,  Revised 
1985) . 

_  For  the  protection  of  human  subjects,  the  investigator (s) 

adhered  to  policies  of  applicable  Federal  Law  45  CFR  46. 

%_A_  jn  conducting  research  utilizing  recombinant  DNA 
technology,  the  investigator (s)  adhered  to  current  guidelines 
promulgated  by  the  National  Institutes  of  Health. 

&A.  in  the  conduct  of  research  utilizing  recombinant  DNA,  the 
investigator (s)  adhered  to  the  NIH  Guidelines  for  Research 
Involving  Recombinant  DNA  Molecules . 

_  In  the  conduct  of  research  involving  hazardous  organisms, 

the  investigator (s)  adhered  to  the  CDC-NIH  Guide  for  Biosafety 
in  Microbiological  and  Biomedical  Laboratories. 
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5.  Introduction 


The  Rho  family  GTPases  Rho,  Rac  and  Cdc42  regulate  a  diverse  array 
of  cellular  processes  such  as  regulation  of  the  actin  cytoskeleton, 
gene  expression  and  cell  cycle  progression.  Not  surprisingly, 
aberrant  Rho  signaling  can  contribute  to  many  disease  processes, 
including  tumorigenesis,  tumor  metastasis  and  developmental 
disorders.  Activation  of  Rho  proteins,  through  release  of  bound  GDP 
and  subsequent  binding  of  GTP,  is  catalyzed  by  guanine  nucleotide 
exchange  factors  (GEFs)  in  the  Dbl  family.  All  members  of  this  group 
of  over  30  molecules,  most  of  which  were  initially  identified  based 
on  their  oncogenic  potential,  share  a  sequence  of  approximately  200 
residues,  termed  the  Dbl-homology  (DH)  domain.  In  order  to  provide 
a  molecular  framework  for  understanding  DH-mediated  nucleotide 
exchange  activity,  specificity  and  regulation,  we  have  determined  the 
solution  structure  of  the  DH  domain  of  human  (3PIX.  Mutagenic 
studies  of  the  DH  domains  of  Dbl,  the  archetypal  member  of  the  Dbl 
family,  and  UNC-73,  a  Dbl  protein  regulating  axon  guidance  in  C. 
elegans,  together  with  sequence  alignments,  identify  the  probable 
GTPase  interaction  site.  Future  research  will  be  aimed  at  obtaining  a 
molecular  basis  for  the  interaction  of  DH  domains  with  Rho-GTPases. 
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6.  Body 


The  Rho  family  GTPases  Rho,  Rac  and  Cdc42  regulate  a  diverse 
array  of  cellular  processes  including  regulation  of  the  actin 
cytoskeleton,  gene  expression  and  cell  cycle  progression,  and 
aberrant  Rho  signaling  can  contribute  to  many  disease  processes, 
including  tumorigenesis,  tumor  metastasis  and  developmental 
disorders. 

Activation  of  Rho-GTPases  has  been  linked  to  the  Dbl  homology 
(DH)  domain  which  was  first  discovered  in  the  diffuse-B-cell 
lymphoma  in  1985  and  subsequently  in  over  30  molecules  (mostly  in 
assays  identifying  oncogenes)  sharing  this  common  domain  of 
approximately  200  residues.  Although  a  vast  array  of  biochemical 
data  exists,  the  3-dimensional  structure  had  yet  to  be  elucidated  in 
order  to  explain  this  data,  including  issues  relating  to  activity  toward 
GTPases  specificity  and  mutangenesis  rendering  many  of  these 
molecules  oncogenic. 

We  have  determined  the  solution  structure  of  the  DH  domain  of 
(3Pix,  a  recently  identified  member  of  this  family.  Furthermore  we 
were  able  to  map  the  GTPase  interaction  site  based  on  structure- 
based  amino  acid  sequence  alignment.  Knowledge  of  the  structure 
also  guided  our  mutagenesis  studies  which  led  to  the  biochemical 
verification  of  the  active  site  and  the  identification  of  key  residues 
involved  in  GEF  function. 

Structure  determination  of  the  DH  domain  by  NMR 
spectroscopy  posed  a  significant  challenge.  There  have  been  to  date 
no  NMR  structures  of  molecules  of  such  relatively  large  size  (24kDa) 
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without  prior  knowledge  of  the  X-ray  structure.  Several  factors 
including  the  size  and  helical  nature  of  the  molecule  complicated 
every  step  significantly. 

In  order  to  overcome  the  difficulties,  we  devised  a  strategy 
involving  state-of-the-art  labeling  schemes  to  obtain  both  chemical 
shift  and  NOE  assignments  from  NMR  experiments  especially 
designed  for  these  labeling  patterns.  This  was  the  first  time  the  full 
scope  of  the  latest  biochemical  and  spectroscopic  advances  were 
employed  in  determining  an  unknown  structure.  Furthermore,  we 
had  to  employ  the  latest  structure  calculation  protocols  (CNS) 
together  with  an  automated  NOE  assignment  program  (ARIA)  in 
order  to  obtain  converging  structures  and  be  able  to  further  refine 
the  structure.  The  final  structure  is  of  high  quality  with  a  root- 
mean-square-deviation  for  an  ensemble  of  20  structures  of  0.79  A 
for  the  helices  only  and  1.09  A  for  the  structured  portion  of  the 
molecule. 

The  tandem  occurence  of  the  DH  domain  with  pleckstrin 
homology  (PH)  domains  suggests  that  the  Dbl  family  PH  domains  may 
also  play  a  more  specific  role  in  regulating  GEF  activity.  We  do  not 
however  observe  any  structural  interactions  between  these  domains 
of  pPix  in  solution. 

Structure  and  sequence  homolgy  directed  mutagenesis  on  a 
variety  of  DH  constructs  enabled  us  to  identify  regions  of  the 
molecule  involved  in  GTPase  nucleotide  exchange.  Both  NMR 
spectroscopic  and  biochemical  nucleotide  exchange  assays  were 
performed. 
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Based  on  this  information,  we  believe  to  be  able  to  explain  the 
low  exchange  activity  of  pPix  toward  GTPases.  Several  residues  in 
the  proposed  GTPase  interaction  surface  that  are  conserved  in  the  DH 
family  are  not  conserved  in  pPix.  Future  focus  will  be  directed  to 
determine  the  relative  significance  of  these  residues  in  mediating 
nucleotide  exchange. 

Knowledge  of  the  structure  of  the  DH  domain  will  also  help  us 
in  investigating  the  regulation  of  these  molecules,  the  loss  of  which 
renders  these  molecules  oncogenic. 
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7.  Appendix 

Research  accomplishments 

•  The  solution  structure  of  the  (3Pix  DH  domain  has  been  determined 
by  multi-dimensional  NMR  spectroscopy. 

•  The  structure  forms  an  elongated  bundle  of  a-helices,  bringing 
together  several  regions  of  high  sequence  conservation. 

•  The  probable  GTPase  interaction  site  has  been  determined  to  lie 
on  one  face  of  the  protein  defined  by  three  conserved  helices. 


Publications 

Aghazadeh,  B.,  Zhu,  K.,  Kubiseski,  T.J.,  Liu,  G.A.,  Pawson,  T.,  Zheng,  Y.  & 
Rosen,  M.K.  Structure  and  Mutagenesis  of  the  Dbl  Homology  Domain. 
Nature  Structural  Biology  5  (12),  1098-1107  (1998). 
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Structure  and  mutagenesis  of  the  Dbl 
homology  domain  ; 

Behzad  Aghazadeh1-2,  Kejin  Zhu4,  Terrance  J.  Kubiseski5,  Grace  A.  Liu1,  Tony  Pawson5,  Yi  Zheng4  and 
Michael  K.  Rosen1-3 


Guanine  nucleotide  exchange  factors  in  the  Dbl  family  activate  Rho  GTPases  by  accelerating  dissociation  of 
bound  GDP,  promoting  acquisition  of  the  GTP-bound  state.  Dbl  proteins  possess  a  -200  residue  catalytic  Dbl- 
homology  (DH)  domain,  that  is  arranged  in  tandem  with  a  C-terminal  pleckstrin  homology  (PH)  domain  in  nearly 
all  cases.  Here  we  report  the  solution  structure  of  the  DH  domain  of  human  PAK-interacting  exchange  protein 
(pPIX).  The  domain  is  composed  of  1 1  a-helices  that  form  a  flattened,  elongated  bundle.  The  structure  explains  a 
large  body  of  mutagenesis  data,  which,  along  with  sequence  comparisons,  identify  the  GTPase  interaction  site 
as  a  surface  formed  by  three  conserved  helices  near  the  center  of  one  face  of  the  domain.  Proximity  of  the  site 
to  the  DH  C-terminus  suggests  a  means  by  which  PH-ligand  interactions  may  be  coupled  to  DH-GTPase 
interactions  to  regulate  signaling  through  the  Dbl  proteins  in  vivo. 


The  Rho  family  GTPases  Rho,  Rac  and  Cdc42  regulate  a  diverse 
array  of  cellular  processes1.  Best  understood  as  regulators  of  the 
actin  cytoskeieton,  each  member  of  the  family  is  capable  of 
inducing  distinct  actin-based  structures  upon  activation.  Cdc42 
can  induce  formation  of  fillipodia  and  focal  complexes2-3;  Rac, 
formation  of  lamellipodia  and  focal  complexes3-4;  and  Rho, 
actin  stress  fibers  and  focal  adhesions5.  Rho  proteins  also  regu¬ 
late  gene  expression6-10  and  cell  cycle  progression6,  implicating 
them  in  control  of  cell  growth.  Not  surprisingly,  aberrant  Rho 
signaling  can  contribute  to  many  disease  processes,  including 
tumorigenesis11,  tumor  metastasis12  and  developmental  disor¬ 
ders13-14. 

Like  all  members  of  the  Ras  superfamily,  the  Rho  proteins 
cycle  between  active  GTP-bound  and  inactive  GDP-bound  con¬ 
formational  states.  In  this  way,  they  function  as  molecular 
switches  in  signaling  pathways,  controlling  information  transfer 
from  cell  surface  receptors  to  intracellular  machinery15.  The 
nucleotide  binding  and  hydrolysis  cycle  is  intrinsically  slow,  and 
is  regulated  in  vivo  by  a  variety  of  proteins  that  serve  to  acceler¬ 
ate  or  block  different  steps  in  the  process16-17.  Activation  of  Rho 
proteins,  through  release  of  bound  GDP  and  subsequent  bind¬ 
ing  of  GTP,  is  catalyzed  by  guanine  nucleotide  exchange  factors 
(GEFs)  in  the  Dbl  family18.  All  members  of  this  group  of  over  30 
molecules  share  a  sequence  of  -200  residues,  termed  the  Dbl- 
homology  (DH)  domain  (Fig.  I)19-21.  While  biochemical  studies 
have  revealed  that  GEF  activity  resides  in  the  DH  domain,  they 
have  also  found  wide  variation  in  the  GTPase  binding  and  cat¬ 
alytic  behavior  of  individual  Dbl  proteins.  Some  family  mem¬ 
bers  have  catalytic  activity  only  toward  a  single  GTPase,  such  as 
the  Cdc42  specific  FGD122  and  the  Rac  specific  UNC-73B23. 
Others  act  on  a  broader  range  of  targets,  such  as  Dbl20  and  Ost24 
which  both  activate  Cdc42  and  Rho.  Still  others  (such  as  Ect-2) 
bind  GTPases  (Rho  and  Rac)  but  have  no  detectable  GEF  activi¬ 
ty  toward  currently  known  Rho  proteins25.  Lastly,  and  most 
unusual  from  a  mechanistic  perspective,  some  DH  domains  are 


GEFs  for  certain  GTPases,  but  bind  others  without  causing 
nucleotide  exchange.  These  include  Lfc,  which  activates  Rho  but 
binds  Rac  as  well26,  and  Ost  which  activates  Cdc42  and  Rho  but 
binds  Rac  in  a  GTP-dependent  manner24.  This  range  of  behavior 
has  led  to  speculation  that  DH  domains  may  not  function  exclu¬ 
sively  as  GEFs  in  the  cell,  but  could  also  play  roles  in  recruiting 
GTPases  to  specific  locations  or  to  specific  molecular  targets26. 

Mechanistic  details  of  DH-mediated  GEF  catalysis  are  poorly 
understood.  However,  it  is  generally  believed  that  DH  domains 
will  function  analogously  to  other,  better  characterized 
exchange  factors  such  as  RCC127  and  the  Cdc25  domain,  which 
act  on  Ran  and  Ras  respectively.  In  these  cases,  detailed  kinetic 
and  thermodynamic  analyses  have  demonstrated  that  GEF 
binding  decreases  GTPase  nucleotide  affinity  by  several  orders 
of  magnitude,  principally  by  effecting  an  increase  in  dissocia¬ 
tion  rate.  GEF  complexation  thus  allows  nucleotide  state  to 
equilibrate  with  conditions  in  the  cell;  as  cellular  GTP  concen¬ 
tration  is  typically  10-fold  higher  than  GDP  concentration 
(lO^1  M  versus  10-5  M),  equilibration  results  in  GTPase  activa¬ 
tion27.  The  recently  reported  structures  of  the  Cdc25  and  SEC7 
GEF  domains  bound  to  their  GTPase  substrates  Ras  and  Arf 
respectively,  have  demonstrated  that  these  effects  result  from  a 
complex  series  of  interactions,  involving  large  rearrangements 
of  the  switch  I  and  II  regions  of  the  GTPase  and  specific  interac¬ 
tions  with  the  phosphate  and  Mg2+  binding  sites.  It  is  not  yet 
clear  if  the  principles  that  have  emerged  from  these  structures 
will  be  generalizable  to  the  actions  of  DH  domains  toward  Rho 
family  GTPases. 

In  virtually  all  Dbl  proteins,  the  DH  domain  is  immediately 
followed  in  sequence  by  a  -100  residue  pleckstrin  homology 
(PH)  domain,  a  widely  distributed  signaling  module  often 
involved  in  intracellular  membrane  targeting28-29.  A  variety  of 
studies  indicate  that  the  PH  domain  is  necessary  for  proper 
intracellular  localization  and  function  of  Dbl  proteins.  In  this 
sense  PH  domain  interactions  in  Rho  signaling  pathways  could 
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Fig.  1  Structure-based  amino  acid  sequence  alignment  (manually  modified  Clusta!)  of  selected  Dbl  homology  domains.  Residues  appearing  in  three 
or  more  sequences  are  boxed  in  red.  Consensus  secondary  structure  of  the  (3PIX  DH  domain  as  determined  by  Procheck-NMR79  is  shown  above  the 
sequence.  Residues  whose  mutation  resulted  in  inactivation  of  Dbl  or  UNC-73  are  indicated  below  the  alignment  by  a  M.  Those  whose  mutation  had 
no  effect  on  the  GEF  activity  of  Dbl  are  indicated  by  a  *. 


function  analogously  to  SH2  and  SH3  domain  interactions  in 
the  Ras  pathway,  to  activate  GEFs  by  recruiting  them  to  mem¬ 
branes,  where  the  prenylated  GTPases  reside28.  However,  the 
stereotypical  pairing  of  DH  with  PH  suggests  that  the  Dbl  family 
PH  domains  may  also  play  a  more  specific  role  in  regulating  GEF 
activity.  This  could  include  direct  contribution  of  binding  or  cat¬ 
alytic  residues,  allosteric  modulation  of  catalytic  activity,  recruit¬ 
ment  of  additional  regulatory  factors  or  precise  positioning  with 
respect  to  GTPases  following  membrane  targeting. 

In  order  to  provide  a  molecular  framework  for  understanding 
DH-mediated  nucleotide  exchange  activity,  specificity  and  regu¬ 
lation,  we  have  determined  the  solution  structure  of  the  DH 
domain  of  human  (3PIX  (also  referred  to  as  pSS0001-1,  ref.  30)  a 
recently  reported  member  of  the  Dbl  family31.  The  structure 
forms  an  elongated  bundle  of  a-helices,  bringing  together  several 
regions  of  high  sequence  conservation.  Mutagenic  studies  of  the 
DH  domains  of  Dbl,  the  archetypal  member  of  the  Dbl  family, 
and  UNC-73,  a  Dbl  protein  regulating  axon  guidance  in  C.  ele- 
gans 23,  together  with  sequence  alignments,  identify  the  probable 
GTPase  interaction  site  as  a  flat  surface  on  one  face  of  the  protein 
defined  by  three  conserved  helices.  Although  the  DH  and  PH 
domains  of  a  pPIX  tandem  construct  do  not  interact  in  solution, 
proximity  of  the  GTPase  contact  site  to  the  DH  C-terminus  sug¬ 
gests  that  such  an  interaction  could  occur  in  other  Dbl  proteins 
having  complementary  DH  and  PH  surfaces. 


Structure  determination 

pPIX  possesses  a  unique  splice  variant  in  which  the  357  amino 
acids  C-terminal  to  Lys  289,  including  the  PH  domain  and  much 
of  the  interdomain  linker,  have  been  replaced  with  -25  residues 
of  apparently  random  sequence  (S.  Bagrodia  and  R.  A.  Cerione, 
unpublished  results).  The  construct  used  in  the  structure  deter¬ 
mination  was  based  on  this  variant,  and  spans  residues  82-289  of 
the  protein. 

Structure  determination  of  the  DH  domain  by  NMR  spec¬ 
troscopy  represented  a  significant  challenge.  The  relatively  large 
size  of  the  protein  (24,000  Afr)  coupled  with  its  elongated  shape 
(see  below)  resulted  in  short  T2  relaxation  times  and  correspond¬ 
ingly  low  sensitivity  in  many  experiments  typically  used  to  make 
chemical  shift  assignments.  Moreover,  the  paucity  of  aromatic 
residues  (six  phenylalanines,  nine  tyrosines,  0  tryptophans)  and 
overabundance  of  leucine  residues  (29,  versus  nine  valines  and 
eight  isoleucines),  exacerbated  the  already  poor  chemical  shift 
dispersion  resulting  from  the  helical  nature  of  the  protein. 

In  order  to  overcome  these  difficulties,  extensive  use  was  made 
of  selective  deuteration  strategies  to  obtain  both  chemical  shift 
and  NOE  assignments.  Nearly  complete  backbone  XHN,  15N, 
13CO,  13Ca  (98%)  and  l3Cp  (98%)  sequential  assignments  were 
achieved  through  analyses  of  2H/15N/l3C-labeled  samples  (see  the 
Methods;  note  that  all  amide  positions  in  deuterated  samples  are 
fully  protonated  due  to  solvent  exchange  during  purification). 
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Fig.  2  Methyl  region  of  a  constant  time  1 H/13C  HSQC  spectrum  of  the  (3PIX 
DH  domain  recorded  at  38  °C  on  the  selectively  methyl/aromatic  proto- 
nated  sample  described  in  the  text. 


Many  side  chain  assignments  were  made  through  a  combination 
of  HCC-  and  CCC-TOCSY-NNH32  experiments  recorded  on  a 
2H(50%)/15N/I3C-Iabeled  sample,  and  HCCH-TOCSY33  experi¬ 
ments  recorded  on  15N/13C  samples.  However,  due  to  poor  trans¬ 
fer  efficiency  and  extensive  methyl  chemical  shift  degeneracy 
(Fig.  2),  these  spectra  yielded  very  few  assignments  of  the  critical 
leucine  and  isoleucine  methyl  groups  necessary  to  define  the 
hydrophobic  core  of  the  protein. 

Key  to  the  structure  determination  process  was  a  sample  labeled 
uniformly  with  i5N,  and  with  at  the  methyl  groups  of  Val, 

Leu  and  lie  (8  only)  residues,  !H/12C  at  Phe  and  Tyr  residues,  and 
2H/13C  at  all  other  carbon  sites34-38.  Methyl  optimized  HCC-  and 
CCC-TOCSY-NNH  spectra39  recorded  on  this  sample  provided 
rapid,  unambiguous  assignments  of  all  valine,  leucine  and 
isoleucine  (5  only)  methyl  proton  and  carbon  chemical  shifts.  In 
addition,  the  long  relaxation  times  afforded  by  high  level  deutera- 
tion  enabled  acquisition  of  high-sensitivity  and  high-resolution 
homonuclear  ^-TOCSY  experiments  to  obtain  complete  assign¬ 
ments  of  aromatic  ring  protons,  which  were  then  connected  to  the 
backbone  using  traditional  NOE-based  methods40. 

The  selective  methyl/aromatic  labeled  sample  was  also  critical 
to  determination  of  the  global  fold  of  the  DH  domain. 
Specifically,  the  narrow  aliphatic  proton  and  carbon  chemical 
shift  ranges  (1.4  p.p.m.  and  15  p.p.m.  respectively)  afforded  by 
the  labeling  pattern  facilitated  acquisition  of  a  series  of  high  reso¬ 
lution  and  high  sensitivity  NOESY  spectra  to  obtain  long-range 
methyl  and  aromatic  NOEs  defining  the  hydrophobic  core.  Thus 
a  homonuclear  2D  NOESY  spectrum  afforded  24  aromatic-aro¬ 
matic  and  11  aromatic -methyl  NOE  assignments,  and  a  4D  I3C- 
I3C  spectrum  afforded  81  methyl-methyl  NOE  assignments  based 
entirely  on  chemical  shifts.  Zwahlen  et  al  have  recently  described 
a  3D  constant  time  13C-NOESY  pulse  sequence  specifically 
designed  for  identification  of  methyl-methyl  NOEs  in  selectively 
methyl  protonated  samples41.  A  modified  version  of  this  sequence 
(M.K.R.,  unpublished),  in  which  both  methyl  lH  and  13C  chemi¬ 
cal  shifts  are  recorded  prior  to  NOESY  transfer  (the  latter  in  con¬ 
stant-time  mode),  followed  by  acquisition  of  all  protons  in  the 
direct  dimension  was  particularly  useful  in  assignment  of  methyl- 
aromatic  NOEs  in  the  DH  domain.  High  resolution  in  all  dimen¬ 
sions  of  the  spectrum  enabled  assignment  of  101  crosspeaks 
involving  36  residues.  In  total,  the  methyl/aromatic  sample  pro¬ 


vided  an  initial  set  of  261  restraints.  These,  together  with  665 
amide  and  65  additional  aliphatic  short-  and  medium-range 
restraints  (protons  separated  by  0-1  or  2-5  residues  respectively), 
100  hydrogen  bond  restraints  (assigned  based  on  slowly  exchang¬ 
ing  amides  in  CSI-defined42  helical  regions),  116  (j)  and  116  \|/ 
restraints  (see  the  Methods),  were  sufficient  to  define  the  global 
fold  of  the  domain.  Calculations  performed  without  aromatic 
NOE  restraints  did  not  yield  convergent  structures. 

Following  determination  of  the  global  fold,  additional  NOEs 
were  assigned  in  an  iterative  fashion  using  the  program  ARIA43. 
The  final  ensemble  of  20  lowest  energy  structures  is  based  on  a 
total  of  2,507  NOE-based  restraints,  130  hydrogen  bond 
restraints,  232  backbone  dihedral  restraints  and  52  side  chain 
dihedral  restraints.  Residues  81-92  and  274-289  show  no  long 
range  NOEs  and  exhibit  intermediate  (81-92,  274-280)  or  fast 
(281-289)  exchange,  indicating  they  do  not  have  a  discreet  struc¬ 
ture  in  the  protein.  Coordinate  precision  (defined  as  the  root- 
mean-square  (r.m.s.)  deviation  from  the  average  structure)  for 
residues  93-273  in  the  20  final  conformers  is  1.07  A  for  backbone 
atoms,  and  1.64  A  for  all  heavy  atoms.  In  regions  of  secondary 
structure  these  values  are  0.82  A  and  1.26  A  respectively. 

DH  domain  structure 

The  DH  domain  is  a  flattened,  elongated  bundle  of  1 1  a-helices 
(approximate  dimensions  60  A  x  30  A  x  20  A),  designated  A-K  in 
Fig.  4.  Seven  of  these  can  be  grouped  into  five  segments  that  align 
approximately  along  the  long  axis  of  the  protein.  Segment  1  consists 
of  helix  A,  segment  2  of  helix  D,  segment  3  of  helices  E,  F  and  G,  seg¬ 
ment  4  of  helices  I  and  J,  and  segment  5  of  helix  K.  Segments  1-4  are 
organized  in  an  up-and-down  bundle,  roughly  around  the  corners 
of  a  parallelogram  (Fig.  4b).  Helix  K  packs  at  one  side  of  this  poly¬ 
gon  in  a  groove  formed  by  helices  E/F  and  J,  resulting  in  an  overall 
trapezoidal  shape  when  viewed  down  the  long  axis  of  the  protein 
(Fig.  4b).  Segments  1  and  4  are  the  most  highly  conserved  regions  of 
the  protein,  and  along  with  helix  K  form  one  elongated  face  of  the 
structure.  The  opposite  face,  formed  by  segments  2  and  3,  is  much 
less  conserved,  with  segment  3  in  particular  showing  large  variabili¬ 
ty  in  residue  composition  and  length  of  the  E-F  linker.  The  structure 
shows  only  minimal  similarity  to  proteins  in  the  structural  database 
(maximum  DALI44  Z-score  =  3.4),  and  is  unrelated  to  the  exchange 


Fig.  3  Stereo  view  of  the  backbone  (N,  Ca,  C1)  of  20  final  superimposed 
structures  of  the  (3PIX  DH  domain.  Residues  81-92  and  274-289  of  the 
construct  are  disordered  in  solution,  not  defined  by  the  NMR  data,  and 
have  been  removed  for  clarity. 
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factors  for  other  small  GTPases,  including  MSS445,  SEC746*47, 
RCC148  and  SOS49. 

DH-PH  Relationship 

In  order  to  identify  potential  contacts  between  the  DH  and  PH 
domains  in  pPIX,  1H/15N  correlation  TROSY50  spectra  of  2H/15N- 
labeled  DH  and  DH-PH  (residues  82  to  406)  constructs  were 
obtained.  At  38  °C,  DH  resonances  in  the  two  data  sets  (repeated 
on  two  separate  samples  each)  showed  only  small  deviations  (8  = 
((A!H  p.p.m.)2  +  (A  15N  p.p.m./9.8)2)1/2)  <  0.03  p.p.m.)  for  all 
amides  in  the  structured  portion  of  the  domain.  A  pattern  of 
slightly  larger  changes  (0.02  p.p.m.  <  8  <  0.03  p.p.m.)  was 
observed  for  residues  near  the  F-G  junction,  which  is  adjacent  to 
the  DH  C-terminus  in  the  structure.  However,  the  magnitude  of 
these  differences  is  clearly  not  indicative  of  a  packed  interface 
between  the  DH  domain  and  residues  of  the  PH  or  linker  regions. 
This  notion  is  further  supported  by  the  observation  that  in 
1H/15N  HSQC  spectra  of  the  DH-PH  construct,  DH  resonances 
are  uniformly  broader  and  weaker  than  those  of  the  PH  domain 
and  linker.  Thus,  in  the  absence  of  additional  factors  (such  as  a 
GTPase)  the  DH  and  PH  domains  of  pPIX  are  not  packed  togeth¬ 
er  into  a  single  unit,  but  rather  behave  more  like  two  independent 
structural  domains  in  solution. 

Mutagenesis 

The  structure  explains  a  large  body  of  mutagenesis  data  obtained 
by  ourselves  and  others  on  a  variety  of  DH  domains.  Many  of  the 
described  inactivating  mutations  clearly  act  through  disruption 
of  structure.  The  disease- causing  mutation  in  the  faciogenital 
dysplasia  1  (FGD1)  gene  results  in  a  ffameshift  at  codon  465,  and 
premature  chain  termination  four  residues  later13.  The  site  of 


b 


Fig.  4  a,  Ribbons80  depiction  of  a  representative  conformer  from  the  final 
ensemble  of  DH  domain  structures.  Side  chains  of  residues  Thr  104  and  Gin 
271/Glu  272,  whose  mutation  in  UNC-73  and  Dbl  respectively,  abrogates 
GEF  activity  are  shown  in  red.  Side  chains  of  conserved  residues  in  the 
putative  GTPase  interaction  site  that  are  discussed  in  the  text  are  blue.  Side 
chains  of  residues  lining  the  depression  between  helices  J  and  K  are  green. 
The  N-  and  C-termini  are  labeled  in  blue.  Orientation  is  the  same  as  in 
Fig.  3.  b,  View  of  the  DH  domain  down  the  long  axis  of  the  molecule, 
rotated  90°  about  a  horizontal  axis  relative  to  (a). 


mutation  corresponds  to  Lys  180  in  PPIX,  which  lies  at  the  pre¬ 
dicted  N-  terminus  of  helix  E.  Thus,  the  FGD1  mutant  is  a  severe¬ 
ly  truncated  protein  that  is  unable  to  fold.  Similarly,  the 
Dbl-inactivating  deletions  of  residues  505-518  and  664-670 
(pPIX  residues  102-115  and  261-267)  reported  by  Hart  et  al 
would  eliminate  portions  of  the  A  and  K  helices  respectively, 
again  destabilizing  the  fold  of  the  protein20.  Mutation  of  Leu  213 
to  Gin  in  the  Vav  oncogene  eliminates  both  transforming  poten¬ 
tial  and  the  ability  of  the  protein  to  activate  PAK  kinase  through 
Racl51.  This  residue  corresponds  to  the  highly  conserved  Leu  1 12 
in  pPIX,  which  is  located  on  the  buried  face  of  helix  A,  and  makes 
numerous  contacts  in  the  protein  hydrophobic  core  to  sidechains 
in  helices  D  and  I.  Replacement  with  a  hydrophilic  side  chain 
would  likely  disrupt  the  core  and  prevent  folding  of  the  domain. 

Other  mutations  appear  to  have  more  direct  effects  on  GTPase 
binding  or  GEF  catalysis,  as  they  involve  conserved  residues  that 
are  solvent  exposed  in  the  structure.  We  have  previously  reported 
that  an  inactivating  allele  of  UNC-73,  a  gene  necessary  for  neu¬ 
ronal  pathfinding  in  C.  elegans ,  contains  a  mutation  in  codon 
1216  that  changes  Ser  to  Phe  in  the  DH  domain  of  the  protein23. 
The  mutation  also  blocks  the  ability  of  purified  recombinant 
UNC-73  expressed  in  insect  cells  to  activate  Racl  in  vitro.  This 
residue  (Thr  104  in  pPIX)  is  either  Ser  or  Thr  in  virtually  all 
known  DH  domains  and  is  located  on  the  exposed  face  of  helix  A, 
adjacent  to  helix  J  (Figs.  4,  6).  Continued  analyses  of  UNC-73 
have  revealed  that  mutation  of  Ser  1216  to  Thr  has  no  effect  on 
the  ability  of  the  recombinant  protein  to  catalyze  nucleotide 
exchange  on  Rac  (Fig.  5a).  However,  mutation  to  either  Ala  or 
Glu  resulted  in  elimination  of  any  significant  activity  at  the  pro¬ 
tein  concentrations  tested.  These  data  indicate  that  the  Ser/Thr  at 
the  position  corresponding  to  Thr  104  in  PPIX,  and  in  particular 
the  hydroxyl  group  of  this  residue,  is  likely  to  mediate  interactions 
with  Racl  that  are  necessary  for  GEF  catalysis. 

To  further  delineate  regions  of  DH-GTPase  interaction,  we 
have  also  prepared  a  battery  of  mutations  in  the  DH  domain  of 
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Fig.  5  a,  Effect  of  Ser  1216  (corresponding  to  Thr  104  in  pPIX) 
mutations  on  UNC-73  DH  activity  towards  mammalian  Rac  GTPase. 
3H-GDP  release  from  GST/Raci  was  measured  in  the  presence  of 
insect  cell-expressed  GST/UNC-73  DH1-PH1  (WT),  and  the  indicated 
mutants  at  residue  1216.  b,  Guanine  nucleotide  exchange  activi¬ 
ties  of  the  DH  domain  mutants  of  Dbl  toward  Cdc42.  The  ability  of 
the  insect  cell-expressed,  purified  GST-Dbl  mutants  to  catalyze  the 
release  of  3H-GDP  from  the  GTPase  was  measured.  Similar  results 
were  obtained  for  the  GEF  activity  of  these  mutants  toward  Rho. 
The  experiment  shown  is  representative  of  at  least  two  indepen¬ 
dent  assays  using  proteins  from  different  preparations.  Both  bar 
graphs  represent  the  amount  of  3H-GDP  bound  to  the  GTPase  after 
20  min  (Racl/UNC-73)  or  5  min  (Cdc42/Dbl)  in  the  presence  of  the 
exchange  factor. 


Dbl.  Recombinant  proteins  were  expressed  in  baculovirus  infect¬ 
ed  insect  cells,  purified  and  assayed  for  GEF  activity  toward 
Cdc42  and  Rho.  Essentially  identical  results  were  obtained  for  the 
two  GTPases,  and  only  data  on  the  former  are  described  here. 
Most  of  the  mutants  show  GEF  activity  identical  to  wild  type  (Fig. 
5b).  Thus,  double  mutants  N540A/K541A,  Q615A/R616A  and 
K643A/E644A,  which  correspond  to  changes  of  N133/I134, 
K21 1/G212  and  K240/E241  in  pPIX  are  fully  active.  Results  simi¬ 
lar  to  the  last  of  these  have  also  been  reported  in  an  analysis  of  the 
S.  cerivisiae  Cdc24  gene.  There,  mutation  of  Lys  442  to  Glu  or  Glu 
423  to  Lys  (PPIX,  K240E  or  E241K)  resulted  in  no  observable 
phenotype,  in  contrast  to  the  severe  defects  in  budding  observed 
on  disruption  of  the  gene52.  The  three  double  mutants  are  located 
in  solvent  exposed  regions  of  helix  C,  the  G-H  loop,  and  C-termi- 
nal  end  of  helix  J  respectively,  and  indicate  that  these  regions  do 
not  play  a  significant  role  in  mediating  GEF  catalysis.  Single 
residue  changes  of  H556A,  E565A,  Q586A  and  E607A,  corre¬ 
sponding  to  Q148A,  E156A,  T181A  and  E202A,  in  PPIX  are  also 
without  effect  on  GEF  activity  toward  Cdc42  or  RhoA.  These 
changes  involve  exposed  residues  at  the  center  and  C-terminus  of 
helix  D,  near  the  N-terminus  of  helix  F  and  at  the  N-terminus  of 
helix  G  respectively,  indicating  these  regions,  too,  are  uninvolved 
in  nucleotide  exchange  catalysis. 

One  double  mutant,  N673A/D674A,  did  show  severely 
impaired  GEF  activity  (Fig.  5b).  These  residues  correspond  to  Gin 
271  and  Glu  272  in  PPIX,  which  are  located  on  the  solvent 
exposed  face  of  helix  K  at  its  C-terminal  end  (Figs  4,  6).  The  first 
is  highly  conserved  throughout  the  Dbl  family,  being  an  Asn  in 
>75%  of  the  reported  sequences.  Notably,  PPIX  is  an  exception  to 
this  trend,  having  a  Gin  at  this  position.  The  second  residue,  Asp 
674,  is  either  an  Asp  or  a  Glu  in  the  majority  of  DH  domains.  The 
high  conservation  of  these  exposed  residues  and  loss  of  activity 


on  mutation  suggests  that  either  or  both  play  a  direct  role  in  GEF 
activity  of  Dbl  toward  Cdc42  and  RhoA. 

GTPase  interaction  site 

The  mutagenesis  data  above  have  identified  two  sites  on  the  sol¬ 
vent  exposed  surface  of  the  A  and  K  helices  that  are  likely  to  medi¬ 
ate  DH  domain  interactions  with  GTPases  during  GEF  catalysis. 
They  have  also  excluded  sites  on  the  C,  D,  F  and  G  helices,  the 
G-H  loop  and  the  C-terminus  of  helix  J  from  direct  participation 
in  the  nucleotide  exchange  process.  In  order  to  identify  other 
potential  DH  domain-GTPase  contact  sites  we  analyzed  patterns 
of  surface  residue  conservation  throughout  the  Dbl  family.  A  sur¬ 
face  of  the  DH  domain,  colored  according  to  degree  of  conserva¬ 
tion  among  the  proteins  listed  in  Fig.  1,  is  shown  in  Fig.  6.  There  is 
pronounced  clustering  of  conserved  residues  on  one  of  the  long 
faces  of  the  protein,  composed  of  segments  1,  4  and  5.  The  most 
conserved  region  lies  between  Thr  104  and  Q271/E272,  the  con¬ 
tact  residues  identified  through  mutagenesis.  On  this  basis,  we 
conclude  that  the  surface  of  the  DH  domain  bounded  by  the  A 
and  K  helices  horizontally  in  Fig.  4  and  spanning  approximately 
the  middle  third  of  the  molecule  vertically  is  likely  to  be  the 
GTPase  contact  site. 

The  central  portion  of  the  interaction  site  consists  of  solvent 
exposed  side  chains  in  the  I-J  loop  and  helix  J.  Residues  at  posi¬ 
tions  227,  230,  234  and  237  are  all  directed  into  solvent,  and  are 
conserved  across  the  Dbl  family.  The  first  three  of  these  are 
hydrophilic  in  character,  with  227  being  Lys,  230  Gin  and  234  Lys 
or  Arg  in  most  DH  domains.  It  is  notable  that  position  230  is  par¬ 
ticularly  highly  conserved,  yet  is  Met  in  pPIX.  Residue  237  is 
hydrophobic,  being  Leu  or  Val  in  most  DH  domains,  but  again  is 
distinct  in  PPIX,  which  has  a  Thr  at  this  position.  These  differ¬ 
ences,  along  with  that  at  residue  271  mentioned  above,  may 
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Fig.  6  a,  Molecular  surface  of 
the  DH  domain  oriented  as  in 
Fig.  3.  Residues  are  color 
coded  according  to  sequence 
conservation  among  the  pro¬ 
teins  listed  in  Fig.  1 .  The  color 
scale  varies  from  white,  rep¬ 
resenting  conservation  in 
three  or  less  Dbi  domains,  to 
red,  representing  conserva¬ 
tion  in  all  nine  domains. 
Helices  are  labeled  in  black, 
and  the  N-  and  C-termini  are 
labeled  in  white  b,  View  fol¬ 
lowing  rotation  of  the  sur¬ 
face  by  180°  about  a  vertical 
axis.  Figures  were  created 
using  the  program  GRASP81. 


account  for  the  low  GEF  activity  of  (3PIX  observed  by  ourselves 
and  others  (B.A.,  M.K.R.,  unpublished  results;  refs  30,  31).  The 
break  between  helices  I  and  J  occurs  near  one  end  of  the  con¬ 
served  site  and  appears  to  be  caused  by  Pro  228.  This  residue, 
whose  side  chain  is  completely  buried,  is  nearly  invariant  in  DH 
domains.  The  structure  imparted  by  the  break  in  the  helices  may 
thus  be  important  in  organization  of  the  GEF  active  site. 
Structure  in  the  J  helix  may  also  be  important  for  catalysis  based 
on  observations  that  in  Dbl,  Cdc24  and  Vav,  the  conservative 
mutation  of  residues  corresponding  to  237-243  in  (3PIX 
(LLLKELL  in  Dbl)  to  IIIRDII  abolishes  GEF  activity20’51’53. 
Residues  Lys  240  and  Glu  241  ((3PIX  numbering)  in  this  sequence 
can  be  mutated  in  Dbl  without  decreasing  GEF  activity.  Thus, 
since  substitution  of  Leu  with  lie  would  not  be  expected  to  gross¬ 
ly  destabilize  DH  structure,  the  inhibitory  effects  of  this  mutation 
suggest  that  GEF  catalysis  may  be  particularly  sensitive  to  the 
structure  of  the  J  helix. 

The  conserved  residues  in  helix  J  are  flanked  on  one  side  by 
exposed  side  chains  in  helix  A.  Many  of  these  are  also  conserved, 
and  the  majority  again  are  hydrophilic.  Residues  100  and  104  are 
Glu  and  Thr/Ser  in  most  DH  domains.  Their  side  chains  contact 
those  at  positions  237, 234  and  230,  forming  a  continuous  region  of 
conserved  surface.  Beneath  this  surface,  a  nearly  invariant  internal 
salt  bridge  involving  pPIX  residues  Glu  105  and  Arg  231  serves  to 
stabilize  packing  between  the  A  and  J  helices,  and  maintain  the 
structure  of  the  active  site. 

Helix  K  packs  against  the  opposite  side  of  helix  J.  While  the 
majority  of  this  C-terminal  segment  has  relatively  variable 
sequence,  position  271  is  Asn  in  most  cases.  This  residue  is  separat¬ 
ed  from  conserved  residues  227  and  230  by  a  shallow  depression 
between  the  N-terminus  of  helix  J  and  the  C-terminus  of  helix  K. 
This  feature  is  lined  by  the  side  chains  of  Ser  226,  Phe  229,  Leu  232, 
Ser  267,  Cys  270,  Cys  187  and  His  190.  Although  polar  in  pPIX, 
sequence  alignment  shows  that  the  depression  will  be  highly 
hydrophobic  in  most  other  DH  domains  (Fig.  1).  In  particular, 


side  chains  at  positions  226,  229, 232,  267  and  270  are  aliphatic  or 
aromatic  in  nearly  all  cases.  The  active  sites  of  the  SEC7  and  Cdc25 
GEF  domains  both  contain,  in  addition  to  hydrophilic  residues, 
important  hydrophobic  clusters  that  make  key  contacts  to  residues 
in  the  switch  II  regions  of  their  GTPase  targets.  The  depression 
between  helices  J  and  K  is  the  only  comparable  region  on  the  con¬ 
served  face  of  the  DH  domain.  Thus,  it  is  possible  that  this  region  of 
the  DH  domain  may  make  similarly  important  contacts  to 
hydrophobic  portions  of  the  Rho  GTPases  during  GEF  catalysis. 

The  exposed  side  chains  of  helices  A,  I/f  and  K  form  a  slightly 
convex,  largely  hydrophilic  surface  with  an  adjacent  depression 
that  is  hydrophobic  in  most  DH  domains.  This  region  is  the  likely 
GEF  active  site  of  the  DH  domain.  Importantly,  the  site  is  located 
immediately  adjacent  to  the  C-terminus  of  the  folded  structure. 
Thus,  the  sites  of  PH  attachment  and  GTPase  binding  are  proxi¬ 
mal.  This  suggests  a  means  of  communication  between  the  two 
domains  that  could  be  used  to  modulate  GTPase  interactions  and 
GEF  activity  in  vivo. 

Functional  implications 

The  nearly  universal  pairing  of  DH  with  PH  suggests  that  the 
activities  of  the  two  domains  may  be  functionally  coupled. 
Evidence  for  this  idea  has  been  provided  in  analyses  of  the 
DH-PH  tandem  of  SOS.  Structures  of  the  PH  domains  from 
human  and  mouse  SOS  contain  an  additional  N-terminal  a-helix 
not  found  in  other  PH  domains54*55.  This  element,  which  appears 
to  be  part  of  the  DH-PH  linker  in  the  full  length  protein,  packs 
very  near  the  phospholipid  binding  site  of  the  PH  domain.  This 
observation  has  led  to  suggestions  that  in  full  length  SOS,  the  DH 
and  PH  domains  may  contact  each  other  in  a  manner  modulated 
by  lipid  binding,  and  that  this  modulation  could  play  a  role  in 
regulating  GEF  activity54,55.  Interdomain  regulation  is  supported 
by  the  discovery  of  Nimnual  et  al  that  expression  of  the  SOS  DH 
domain  alone  in  COS-1  cells  results  in  activation  of  Rac,  while  the 
DH-PH  tandem  has  this  effect  only  when  coexpressed  with  acti- 
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Table  1  Structural  statistics1 

Restraints  for  structure  calculation 

Total  restraints  used 

2,847 

Total  NOE  restraints 

2,507 

Total  Unambiguous  Restraints 

2,263 

Intraresidue 

1,212 

Sequential 

566 

Medium  range 

284 

Long  range 

201 

Total  Ambiguous  Restraints 

244 

Hydrogen  bond  restraints 

130 

Dihedrals  <}) 

116 

Dihedrals  vy 

116 

Dihedrals  %  (Thr,  Leu,  Val,  He) 

52 

Statistics  for  structure  calculations 

R.m.s.d.  from  experimental  restraints2 
Distances  (A) 

0.019  A  ±0.002 

Angles  (°) 

0.29°  ±0.17 

Final  Energies  (kcal  moh1) 

Etotal 

474  ±  28 

^bonds 

17  ±  4 

^angles 

148  ±12 

EvdW 

35  ±3 

Enoe 

47  ±8 

Coordinate  precision3 

R.m.s.d.  of  Backbone  Atoms  (N,  Ca,  C') 

Helices 

0.79  A  ±0.12 

Residues  13-193 

1 .09  A  ±  0.16 

R.m.s.d.  of  all  Heavy  Atoms 

Helices 

1.24 

Residues  13-193 

1.65 

Procheck 

Residues  in  most  favored  region 

Helices 

86% 

Residues  13-193 

79% 

Residues  in  additional  allowed  region 

Helices 

13% 

Residues  13-193 

17% 

Residues  in  generously  allowed  region 

Helices 

0.6% 

Residues  13-193 

2.3% 

Residues  in  disallowed  region 

Helices 

0.8% 

Residues  13-193 

1.8% 

’The  final  values  for  the  force  constants  employed  for  the  various  terms 
in  the  target  function  employed  for  simulated  annealing  are  as  follows: 
5  kcal  moh1  A-4  for  the  quartic  van  der  Waals  repulsion  term  (with  van  der 
Waals  radii  set  to  0.78  times  their  value  defined  in  the  paramallhdg-pro 
parameter  set),  50  kcal  moHA"2  for  the  experimental  distance  restraints, 
200  kcal  moh1  rad-2  for  the  experimental  torsion  angle  restraints.2None  of 
the  final  structures  have  distance  restraint  violations  greater  than  0.3  A 
or  dihedral  violations  greater  than  5*. 

3Defined  as  average  root-mean-squared  difference  for  the  stated 
residues  between  the  final  20  structures  and  the  average  coordinates. 


vated  Ras56  that  is  capable  of  stimulating  phosphatidyl  inositol 
3’-kinase  (PI3'K).  On  this  basis  it  was  postulated  that  binding  of  the 
lipid  product  of  PI3'K,  phosphatidylinositol-3,4, 5-triphosphate,  to 
the  SOS  PH  domain  may  enhance  DH  domain  GEF  activity.  Such 
phenomena,  however,  are  not  universal.  The  DH  and  PH  domains 
of  pPIX  clearly  act  as  independent  units  in  a  tandem  construct  in 
solution.  It  is  possible  that  the  behavior  observed  here  is  specific  to 
this  system,  since  the  pPIX  splice  variant  mentioned  above  is  the 


only  known  Dbl  protein  lacking  a  PH  domain.  But  data  on  other 
Dbl  proteins  suggest  this  is  not  so.  For  example,  the  Lbc  DH 
domain  alone  can  stimulate  DNA  synthesis  and  actin  stress  fiber 
formation57.  Alternatively,  while  the  transforming  activity  of  the 
Lfc  oncogene  can  be  eliminated  through  removal  of  its  PH  domain, 
activity  is  retained  when  the  domain  is  substituted  by  a  prenylation 
sequence58,  indicating  the  PH  domain  plays  either  both  inhibitory 
and  targeting  roles  or  simply  the  latter  in  mediating  transforma¬ 
tion.  Thus,  although  the  structural  data  presented  here  demon¬ 
strate  the  potential  for  DH-PH  interactions  in  the  tandem, 
realization  of  this  potential  evidently  does  not  occur  in  all  cases.  In 
fact,  these  interactions  themselves  may  be  regulated,  either  by  the 
additional  domains  in  the  molecule,  or  through  binding  of  addi¬ 
tional  molecules.  In  this  regard,  it  is  interesting  that  pPIX  has  been 
found  in  a  stable  ternary  complex  with  activated  Cdc42  and  PAK30. 
Structural  and  biochemical  analyses  of  this  complex  may  yield 
additional  insights  into  the  control  of  GEF  activity  in  vivo . 

Sequence  alignments  show  that  there  are  no  completely  invari¬ 
ant  residues  in  the  DH  domain.  Even  residues  in  the  GTPase 
interaction  site  show  some  variation  across  the  family.  This  may 
reflect  both  the  nature  of  nucleotide  exchange  catalysis  and  differ¬ 
ences  in  activity  and  regulation  of  the  Dbl  family  members.  A 
fundamental  feature  of  the  exchange  process  appears  to  be  stabi¬ 
lization  of  a  nucleotide-free  form  of  the  GTPases.  This,  in  turn 
decreases  the  affinity  of  the  GEF-GTPase  complex  for  nucleotides 
through  an  increase  in  nucleotide  dissociation  rate.  GTPase  load¬ 
ing  can  then  rapidly  equilibrate  to  a  composition  that  depends  on 
the  intrinsic  affinities  for  GTP  and  GDP  (which  varies  apprecia¬ 
bly  among  the  Ras  superfamily  members)  and  local  cellular  con¬ 
centrations  of  the  two  nucleotides27*59.  Recent  structures  of  the 
Ras-SOS49  and  Arf-Gea260  complexes  have  illustrated  that  the 
mechanics  underlying  these  thermodynamic  changes  are  com¬ 
plex.  In  both  cases,  recognition  occurs  through  extensive  contacts 
centered  on  the  switch  II  region  of  the  GTPase  and  extending  to 
switch  I  and  the  a3  helix.  Switch  regions  I  and  II  are  extensively 
rearranged  on  complexation,  opening  the  nucleotide  pocket  dra¬ 
matically.  Nucleotide  and  Mg2+  binding  sites  are  also  disrupted 
either  through  direct  insertion  of  GEF  residues,  conformational 
changes  in  specific  GTPase  side  chains,  or  both.  It  is  not  clear 
which  of  the  interactions  with  GEF  drive  the  conformational 
changes,  and  therefore  drive  catalysis,  and  which  simply  result 
from  optimization  of  energetics  in  the  nucleotide-free,  distorted 
state.  For  the  SEC7  domain,  approximately  103  of  the  total 
105-fold  increase  in  nucleotide  dissociation  rate  can  be  attributed 
to  insertion  of  a  single  Glu  side  chain  into  the  nucleotide  phos¬ 
phate  binding  pocket60*61.  In  the  case  of  SOS,  however,  it  appears 
from  sequence  alignments  that  the  interactions  may  be  distrib¬ 
uted,  with  no  single  residue  being  absolutely  essential  to  the 
process49.  Dbl  family  sequence  variability  may  then  reflect  a  simi¬ 
lar  situation  in  DH-mediated  GEF  catalysis. 

Mutagenesis  studies  indicate  that  mechanisms  of  DH  action 
may  also  be  significantly  different  from  those  observed  for  SOS 
and  Gea262.  Thus  sensitivity  of  Rho  and  Cdc42  toward  Lbc  and 
Cdc24  respectively,  is  unaffected  by  extensive  mutations  through¬ 
out  the  switch  II  regions.  In  contrast,  mutations  in  switch  I  signif¬ 
icantly  decrease  GEF  responsiveness.  Moreover,  individual 
mutations  in  switch  I  have  different  effects  on  responsiveness  to 
different  GEFs.  For  example  Rho  mutations  T35A  and  F39E  abro¬ 
gate  GEF  activity  of  Lbc,  but  do  not  effect  that  of  Dbl62.  Thus,  it 
appears  that  DH  catalyzed  exchange  may  have  certain  character¬ 
istics  that  are  mechanistically  distinct  from  those  of  other  GEFs, 
and  that  even  within  the  family  there  may  be  more  subtle  differ¬ 
ences  in  recognition.  The  observed  variability  in  DH  sequence 
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could  then  also  reflect  these  issues.  Because  of  these  complexities 
of  DH-mediated  exchange  catalysis,  complete  understanding  of 
Dbl  family  GEF  activity  will  require  structures  of  at  least  several 
family  members  both  free  and  in  complex  with  their  GTPase  sub¬ 
strates.  The  work  described  here  provides  a  first  step  in  this  direc¬ 
tion,  and  establishes  a  foundation  for  further  study  of  the 
biochemical  and  biological  functions  of  the  Dbl  proteins. 

Note  added  in  proof:  While  this  manuscript  was  under  review  the 
structures  of  the  DH-PH  tandem  from  SOS63  and  the  N-terminal 
DH  domain  of  Trio64  were  reported .  Both  structures  are  very  similar 
to  the  one  described  here . 

Methods 

NMR  sample  preparation.  Residues  82-289  (DH  domain  construct) 
or  82-406  (DH-PH  tandem  construct)  of  pPIX  were  amplified  from 
the  full-length  cDNA  by  PCR,  subcloned  into  the  pETIIa  expression 
vector  (Novagen)  and  transformed  into  E.coli  strain  BL21(DE3). 
Uniform  15N  and  13C  labeling  was  obtained  by  growing  cells  in  mini¬ 
mal  media  supplemented  with  15NH4CI  and/or  13C6-glucose  as  the  sole 
nitrogen  and  carbon  sources  respectively.  Random  fractional  deuter- 
ation  was  obtained  by  growing  cells  in  media  containing  50-75% 
D20.  Full  deuteration  at  all  proton  positions  was  obtained  by  grow¬ 
ing  cells  in  100%  D20  media  supplemented  with  13C62H7-glucose 
(Cambridge  Isotope  Laboratories)  as  the  sole  carbon  source.  Selective 
protonation  at  methyl  positions  of  leucine,  valine  and  isoleucine  was 
obtained  as  described35  by  addition  of  60  mg  M  of  13C,15N,2-2H 
(50%),3-2H  valine  (Cambridge  Isotope  Laboratories)  and  -40  mg  M 
13C,3-2H2  a-ketobutyrate  to  fully  deuterated  media  30  min  before 
induction.  Similarly,  addition  of  50  mg  M  unlabeled  protonated  tyro¬ 
sine  and  phenylalanine  led  to  the  protonation  of  the  side  chains  of 
these  aromatics.  Cells  were  grown  at  37  °C  to  an  optical  density  of 
0.8  at  600  nm.  Protein  expression  was  induced  for  12  h  with  1  mM 
isopropyl-D-thiogalactoside  at  25  °C  (H20  and  partially  deuterated 
growths)  or  30  °C  (100%  deuterated  media).  Protein  was  purified 
from  bacterial  lysate  by  successive  weak  anion  exchange  (Pharmacia 
DEAE-CL6B),  strong  anion  exchange  (Pharmacia  Mono-Q)  and  gel  fil¬ 
tration  (Pharmacia  Superdex-75)  chromatographies.  NMR  samples 
contained  approximately  1.3  mM  protein  in  20  mM  phosphate 
buffer  pH  7,  150  mM  NaCl,  1  mM  EDTA,  3  mM  DTT,  0.1%  NaN3. 

Dbl  and  UNC-73  mutagenesis  and  protein  expression.  Wild  type 
and  mutant  cDNAs  (generated  by  PCR  and  verified  by  automated 
sequencing  in  both  directions)  encoding  the  DH  and  PH  domains  of  Dbl 
(residues  498-825)  were  fused  to  a  cDNA  encoding  glutathione 
S-transferase  (GST)  and  inserted  into  the  baculovirus  transfer  vector 
pVL1393  (Invitrogen).  Recombinant  baculovirus  was  produced  by  trans¬ 
fection  into  Sf9  insect  cells  using  the  BaculoGold  kit  (Pharmingen). 
Expression  and  purification  of  GST-fusion  small  GTP-binding  proteins  in 
E  coli  and  the  GST-fusion  Dbl  mutants  in  Sf9  cells  were  performed  as 
described65.  Expression  and  purification  of  UNC-73  proteins  spanning 
residues  1179-1638  (DH-1/PH-1/SH3)  were  as  described23. 

Nucleotide  exchange  assays.  In  assays  of  UNC-73,  2  pig  of  GST-Rac 
was  preloaded  with  3H-GDP,  then  mixed  in  reaction  buffer  with  0.5  jig 
of  the  exchange  factor.  The  amount  of  3H-GDP  bound  to  Rac  at  time 
point  0  min  was  used  to  determine  100%  GDP-bound.  Dbl-induced  3H- 
GDP/GTP  exchange  on  Cdc42Hs  and  RhoA  was  measured  at  24  °C  as 
described65  using  -0.5  pg  purified  GST-Dbl  mutants  and  1  pg  GTPase. 

NMR  spectroscopy.  NMR  experiments  were  carried  out  at  38  °C  on  a 
Varian  Innova  600  MHz  spectrometer  equipped  with  four  channels 
and  pulsed  field  gradients.  Data  were  processed  using  NMRpipe66  and 
analyzed  with  NMRview  software  packages67.  Backbone  ’HN,  15N, 
13Ca,  13C|3  and  13CO  assignments  were  obtained  for  the  DH  domain 
through  the  following  2H-decoupled,  triple  resonance  spectra  applied 
to  a  2H(75%)/15N/13C-labe!ed  sample:  CT-HNCa,  CT-HN(CO)Ca, 
HN(Ca)Cb,  HN(COCa)Cb68,  HNCO69-71  and  HN(Ca)C072.  Assignments 
were  confirmed  by  HNCaHa69-73  and  Ha(CaCO)NNH69-74  spectra 
acquired  on  a  fully  protonated  15N/13C-labeled  sample,  and  through 


sequential  HN-HN  and  HN-Ha  NOEs  observed  in  an  15N-edited  NOESY 
spectrum75.  Side  chain  resonances  were  assigned  on  a  fully  protonat¬ 
ed  sample  through  HCCH-TOCSY33,  CCC-TOCSY-NNH  and  HCC-TOCSY- 
NNH32  experiments.  Some  side  chain  ’H  and  13C  assignments  were  also 
obtained  through  the  latter  experiment  applied  to  a  2H(75%)/15N/13C- 
labeled  sample.  All  leucine,  valine  and  isoleucine  (5)  methyl  ’H  and  13C 
assignments  were  obtained  using  the  selectively  methyl/aromatic 
protonated  sample  described  in  the  text  using  optimized  versions  of 
the  HCC-  and  CCC-TOCSY-NNH  experiments39.  All  Phe  and  Tyr  aromat¬ 
ic  proton  assignments  were  obtained  through  homonuclear  ’H-’H 
TOCSY  and  NOESY  experiments  applied  to  the  same  sample. 

The  following  NOESY  spectra  were  used  to  generate  distance 
restraints  for  structure  calculation:  3D  15N-edited  NOESY  (40  ms,  70  ms; 
15N-labeled  sample);  3D  13C-edited  NOESY,  3D  simultaneous  15N/13C- 
edited  NOESY  (both  75  ms;  15N/13C-labeled  sample);  3D  constant  time 
13C-edited  NOESY  (175  ms),  2D  ’H  NOESY  (175  ms),  4D  13C/13C  edited 
NOESY  (175  ms)  and  4D  13C/15N  edited  NOESY  (175  ms  mixing  time)  all 
recorded  on  the  selectively  methyl/aromatic  protonated  sample. 

A  total  of  38  %1  (±20-40°)  and  14  dihedral  restraints  (±40°,  Leu 
residues  only)  were  identified  based  on  patterns  of  intraresidue  and 
sequential  NOEs  in  the  15N-  and  13C-edited  NOESY  spectra.  Stereospecific 
Hp  assignments  for  11  Leu  residues  were  obtained  similarly. 
Stereospecific  assignments  of  valine  and  leucine  methyl  groups  were 
obtained  from  a  CT-WK  HSQC  spectrum  of  a  10%  13C-labe!ed  sample76. 

Structure  calculations.  Initial  structures  were  calculated  using  a 
standard  torsion  angle  simulated  annealing  dynamics  protocol 
implemented  in  the  program  CNS  version  0.377.  Distance  restraints  in 
these  calculations  were  obtained  from  the  3D  15N-edited  NOESY,  3D 
13C-edited  NOESY,  4D  13C/13C  edited  NOESY,  4D  ’3C/15N  and  the  3D  CT 
methyl  edited  experiments  as  described  in  the  text.  Restraints  were 
classified  as  strong  (1. 8-2.7  A,  2.9  A  for  amides)  medium  (1.8-3. 5  A) 
and  weak  (1. 8-5.0  A).  An  additional  0.5  A  was  added  to  the  upper 
bound  for  all  methyl  groups.  Helices  were  initially  identified  by 
analysis  of  the  chemical  shift  index42,  short  and  medium  range  HN- 
Ha  and  Ha-Hp  NOEs  and  backbone  HN  exchange  data.  For  the  116 
residues  that  satisfied  the  a-helix  conformation  under  these  criteria, 
<t>  and  \|/  dihedral  restraints  of  -60°  ±  30°  and  -40°  ±  50°  respectively, 
were  included.  Converged  structures  from  the  CNS  calculations  were 
then  used  as  input  to  ARIA43,  which  was  subsequently  utilized  for  the 
iterative  automated  assignment  of  all  spectra  and  refinement  of  the 
structures  using  X-PLOR  (version  3.851  )78.  Tolerances  for  the  auto¬ 
mated  assignment  of  NOE  peaks  in  ARIA  were  set  to  0.04  p.p.m.  in 
indirect  and  acquisition  proton  and  0.4  p.p.m.  in  heteroatom  dimen¬ 
sion  for  all  spectra.  Peaks  in  severely  overlapped  regions  of  the  spec¬ 
tra  were  not  included  to  avoid  false  assignments  due  to  incorrect 
positioning  of  these  peaks.  Ten  structures  were  calculated  in  each 
iteration.  The  iterative  assignment  of  NOEs  was  based  on  eight  struc¬ 
tures  with  the  lowest  number  of  violations  calculated  in  the  preced¬ 
ing  iteration.  An  assignment  was  included  if  it  could  be  satisfied  in  at 
least  50%  of  the  structures.  The  cutoff  value  of  an  ambiguous  assign¬ 
ment  was  reduced  progressively  from  0.99  for  the  first  iteration  to 
0.45  in  iteration  11.  After  each  iteration,  structures  obtained  were 
analyzed  to  ensure  that  none  of  the  restraints  utilized  in  the  initial 
CNS  calculations  were  violated.  Final  structures  were  evaluated  using 
the  program  ProcheckNMR79. 

Coordinates.  Coordinates  have  been  deposited  in  the  Protein  Data 
Bank  (accession  number  1 BY1). 
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product  review 


Oxford  Cryosystems  has  announced  the  launch  of  the  Oxford 
HeliX,  the  first  commercially  available  helium  system  for  X-ray 
crystallography  in  which  the  crystal  sample  is  cooled  in 
an  open  flow  gas  stream  rather  than  being 
enclosed  by  windows.  The  Oxford  HeliX  uti¬ 
lizes  helium  gas,  not  liquid,  from  a  helium 
cylinder  and  cools  it  to  temperatures  as 
low  as  28  K.  This  is  achieved  by  using  a 
integrated  two  stage  closed  cycle  cooler 
and  a  patented  gas  delivery  system. 

Other  benefits  of  the  Oxford  HeliX 
include:  temperature  stability  of  0.3  K, 
reduction  of  helium  gas  consumption  to 
7.5  1  min-1,  a  push  button  temperature 
controller  that  is  simple  to  use,  and  proven 
mechanical  reliability.  The  low  flow  rate  is  ^ 
designed  to  minimize  the  consumption  of  heli¬ 
um  while  still  providing  sufficient  flow  to  cool  the 
sample  and  eliminate  the  possibility  of  ice  formation. 

Oxford  Cryosystems  is  the  home  of  the  nitrogen 
gas  Cryostream  Cooler,  a  popular  flow  nitrogen 


system.  The  design  of  that  system  has  helped  to  make  the  difficult 
task  of  low  temperature  X-ray  data  collection  a  more  routine 
procedure,  and  now  the  development  of  the  Oxford 
HeliX  will  allow  routine  data  collection  at  even 
lower  temperatures. 

The  Oxford  HeliX  is  useful  for  any  crys- 
tallographer  wishing  to  collect  X-ray 
data  below  nitrogen  temperatures 
where  an  open  flow  gas  stream  is 
essential.  Experiments  in  which  this 
would  be  useful  include  charge  densi¬ 
ty  studies,  where  lower  temperatures 
offer  enhanced  data  quality,  and  the 
cooling  of  biological  samples  in  intense 
X-ray  sources,  where  lower  temperatures 
can  increase  the  life  time  of  the  crystal. 

For  more  information  on  the  Oxford  HeliX,  contact: 
Oxford  Cryosystems 
www.  OxfordCryosystems.  co.  uk 
email:  info@OxfordCryosystemsxo.uk 


Molecular  Structure  Corporation  has  developed  the  first  commercially  available 
xenon  derivatization  flash  cooling  device,  the  Cryo-Xe-Siter.  A  critical  step  in  X-ray 
crystallographic  structure  determination  of  a  macromolecule  is  the  preparation  of 
heavy  atom  derivatives  of  the  crystal.  The  noble  gas  xenon  binds  to  hydrophobic 
sites  and  can  serve  as  the  needed  heavy  atom  derivative.  However,  a  major  challenge 
to  the  widespread  use  of  xenon  has  been  the  technical  difficulty  of  preparing  the 
derivatized  crystals.  Now,  with  the  the  Cryo-Xe-Siter,  xenon  derivatives  of  protein 
crystals  can  be  produced  easily — and  at  cryogenic  temperatures. 

The  unique  two-chamber  design  allows  a  protein  crystal  to  reach  equilibrium  in 
the  xenon  atmosphere,  at  pressures  up  to  300  p.s.i.  The  crystal  is  then  flash  frozen 
while  still  under  pressure.  The  Cryo-Xe-Siter  accommodates  popular  pin  mounts 
and  has  a  small  chamber  to  minimize  xenon  consumption.  It  is  easy  to  use  and  safe 
to  operate. 


For  more  information  on  the  Cryo-Xe-Siter ,  contact: 
Molecular  Structure  Corporation 
tel  281  363  1033 
fax  281 3643628 
h  ttp'Jfwww.  msc.  com / 
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